History: XML dumps and research needs for WMF projects

Preview of version: 1

  • Session time: Thursday, July 8, 2010, 9 AM
  • Facilitator: Ariel Glenn
  • Participants: Kevin Crowston, Victor Grishchenko, Daniel Kinzlerm Roan Kattouw, Andreea Gorbatai

Discussion topics:
  • Proposals for new information in the XML dumps, for various ways to segment the dumps into smaller chunks or produce samples
  • Types of usage statistics people want to see produced, navigation path statistics
  • Proposal to collect and provide search terms from Lucene and Google searches, track search successes and failures
  • Shared researcher collaboration and computing platform for sharing dumps, samples, tools, research results and for providing disk space and computing power

There will be a wiki page at www.mediawiki.org (external link), see the user page User:ArielGlenn there for a link.

History

Legend: v=view, s=source
Date UserEdit Comment Version Action
Thu 08 of July, 2010 06:07 EDT ArielGlenn   3
Current
 v  s
Thu 08 of July, 2010 04:52 EDT ArielGlenn   2  v  s  
Thu 08 of July, 2010 04:43 EDT ArielGlenn   1  v  s